Dominant Feature Vectors Based Audio Similarity Measure

نویسندگان

  • Jing Gu
  • Lie Lu
  • Rui Cai
  • HongJiang Zhang
  • Jian Yang
چکیده

This paper presents an approach to extracting dominant feature vectors from an individual audio clip and then proposes a new similarity measure based on the dominant feature vectors. Instead of using the mean and standard deviation of frame features in most conventional methods, the most salient characteristics of an audio clip are represented in the form of several dominant feature vectors. These dominant feature vectors give a better description of the fundamental properties of an audio clip, especially when frame features change a lot along the time line. Evaluations on a content-based audio retrieval system indicate an obvious improvement after using the proposed similarity measure, compared with some other conventional methods.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Automatic Music Style Classification: Towards the Detection of Perceptually Similar Music

The fundamental problem investigated is to develop a model space that automatically classifies musics into categories by the detection of perceptually similar patterns in music. The concept is based on a modular system consisting of three main stages. The first stage involves the preprocessing of the raw audio data with passing over by a number of independent feature extractors. Each feature ex...

متن کامل

A Robust Musical Audio Search Method Based on Diagonal Dynamic Programming Matching of Self-Similarity Matrices

We propose a new musical audio search method based on audio signal matching that can cope with key and tempo variations. The method employs the self-similarity matrix of an audio signal to represent a key-invariant structure of musical audio. And, we use dynamic programming (DP) matching of self-similarity matrices to deal with time variations. However, conventional DP-based sequence matching m...

متن کامل

Semiautomatic Benchmarking of Feature Vectors for Multimedia Retrieval

Modern Digital Library applications store and process massive amounts of information. Usually, this data is not limited to raw textual or numeric data – typical applications also deal with multimedia data such as images, audio, video, or 3D geometric models. For providing effective retrieval functionality, appropriate meta data descriptors that allow calculation of similarity scores between dat...

متن کامل

A Nonlinear Viseme Model for Triphone-Based Speech Synthesis

This paper presents a new learning-based approach to speech synthesis that achieves mouth movements with rich and expressive articulation for novel audio input. From a database of 3D triphone motions, our algorithm picks the optimal sequences based on a triphone similarity measure, and concatenates them to create new utterances that include coarticulation effects. By using a Locally Linear Embe...

متن کامل

Multi-band Approach of Audio So Empirical Mode Dec

This paper presents a content-based approach of audio source indexing without any prior knowledge about the sources. The empirical mode decomposition (EMD) scheme, capable of decomposing nonlinear and non-stationary signals into some bases, is employed to implement the sub-band approach of the audio discrimination technique. The feature vectors are derived from each of selected sub-bands of the...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2004